Back

G3 Genes|Genomes|Genetics

Oxford University Press (OUP)

Preprints posted in the last 90 days, ranked by how well they match G3 Genes|Genomes|Genetics's content profile, based on 351 papers previously published here. The average preprint has a 0.11% match score for this journal, so anything above that is already an above-average fit.

1
Population genomics of Drosophila pseudoobscura

Manat, Y.; Zheng, Z.; Kritzell, C. A.; Gonzales, C. A.; Meisel, R. P.

2026-02-04 genomics 10.64898/2026.02.02.703370 medRxiv
Top 0.1%
50.2%
Show abstract

Drosophila pseudoobscura is a long-standing model organism in evolutionary genetics because natural populations segregate for an inversion polymorphism on the third chromosome. In addition, D. pseudoobscura has a neo-X chromosome that was formed by an X-autosome fusion, which segregates for a sex-ratio drive allele. Previous genome-wide studies of DNA sequence variation in D. pseudoobscura have focused on individual chromosomes or did not use chromosome-scale reference genomes. To address these shortcomings, we generated a new D. pseudoobscura population genetic resource by sequencing the genomes of over 60 inbred lines sampled across the species geographic range in North America. Using these data, we examined patterns of nucleotide diversity and population structure across the entire genome. Tajimas D was negative across most of the genome, consistent with a recent population expansion. However, there was substantial heterogeneity of D across chromosomes, suggesting distinct evolutionary dynamics across the genome. We found no strong evidence of population structure across most chromosomes, consistent with a near panmictic population. In contrast, we identified population structure on the third chromosome, which we attributed to the inversion polymorphism and used to infer the arrangements carried by the strains we sampled. Our analysis therefore demonstrates that tests for population structure can identify polymorphic chromosomal rearrangements. The population genomic data we have collected is publicly available and will support future research on genome evolution, local adaptation, and sex chromosome evolution in D. pseudoobscura.

2
A mdg4 Retrotransposon Screen for X-linked Female Sterile Alleles and its Relationship with the Transcription Factor OVO

Benner, L.; Oliver, B. C.

2026-02-14 genetics 10.64898/2026.02.12.705638 medRxiv
Top 0.1%
18.5%
Show abstract

In the germline, the mdg4 retrotransposon integrates in close proximity to the location of OVO DNA binding motifs, suggesting that insertion bias is driven by the OVO transcription factor. A classical genetic example of this is the reversion of the dominant female-sterile allele, ovoD1, by the transposition of mdg4 into the ovo promoter where OVO protein binds. We wanted to take advantage of this relationship and determine if we could recover female sterile alleles along the X chromosome due to mdg4 insertion, with the hypothesis that these would be genes that OVO binds and transcriptionally regulates in the germline. We mobilized the mdg4 retrotransposon with the use of mutants for the lncRNA gene flamenco (flam) and recovered 17 recessive female sterile alleles out of a total of 1,192 chromosomes screened. We identified 11 complementation groups, for which a mdg4 insertion was responsible for female sterility in 7 groups. Notably, a complementation group consisting of 6 alleles was found to be the result of a Doc transposable element insertion into the gene Grip91 and is potentially evidence for a Doc insertional hotspot in the genome. Our screen also uncovered that 7/17 recessive female sterile chromosomes contained multiple transposable element insertions indicating that flam- females derepress numerous transposable elements that can lead to multiple transposon insertions along a single chromosome, as has been suggested previously. Altogether, we found that mdg4 did have an insertion bias into OVO bound regions of the genome that can result in female sterility, however, this was the case for a minority of the female sterile alleles recovered with this method. Article SummaryThe retrotransposon mdg4 preferentially inserts near binding sites of the female germline transcription factor OVO in Drosophila melanogaster, most notably at the ovo locus itself. We leveraged this relationship to screen for X-linked recessive female-sterile mutations generated by mdg4 mobilization in flamenco mutant females. From 1,192 chromosomes, we recovered 17 female-sterile alleles across 11 complementation groups. mdg4 insertions were significantly enriched in OVO-bound regions but accounted for only a subset of sterility phenotypes, revealing substantial background mutagenesis by other transposable elements. These results refine the OVO-mdg4 relationship and highlight both the promise and limitations of transposon-based genetic screens.

3
Allelic Association Analyses: Estimation Recommendations

Weir, B. S.; Goudet, J.

2026-01-30 genomics 10.64898/2026.01.26.701864 medRxiv
Top 0.1%
18.2%
Show abstract

We review the rich literature on the estimation of measures of inbreeding, relatedness and population structure, beginning with Sewall Wrights F-statistics and moving onto the descriptive statistics of Masatoshi Nei and Clark Cockerham. The current availability of genome-level single nucleotide variant data is allowing for sophisticated treatments of inferred identity by descent segments and inferred ancestral recombination graphs. Underlying such disparate methods is an emphasis of characterizing the descent status of alleles within and between individuals and populations and we have found allele-sharing statistics a convenient framework for examining the differences and similarities among different estimators. We have been able to resolve some long-standing reported differences among estimators, especially those involving the work of Nei. In the course of our algebraic and empirical treatment of descent measure estimation we have been able to formulate a set of five recommendations. Following the early work of Sewall Wright, we recommend 1. State that descent measures for pairs of alleles are relative to values in a reference set of allele pairs. With this view, we recommend 2. Use estimators that preserve descent measure rankings over different reference sets. Allele-sharing estimators satisfy this recommendation. Reducing genotypic data to allelic data has the benefit of reducing dimensionality, but we recommend 3. If genotypic data are available, avoid having to assume Hardy-Weinberg equilibrium by not reducing them to allelic data. Partly as a consequence of working with genotypic data, we recommend 4. Recognize that allele frequencies do not need to be estimated. Not estimating allele frequencies prevents the confounding of descent estimates for target pairs of alleles by the status of all pairs in a reference set. On the basis of both theoretical and empirical results, finally we recommend 5. Consider both inbreeding and kinship when estimating either one. It is difficult to envisage a natural population with relatedness but no inbreeding, or vice versa.

4
Transposons contribute to splice-isoform diversity in the Drosophila brain

Choucri, M.; Treiber, C. D.

2026-01-26 genomics 10.64898/2026.01.22.701052 medRxiv
Top 0.1%
18.2%
Show abstract

The extraordinary complexity of the brain depends in part on the vast diversity of mRNA isoforms it expresses, often in a cell-type specific manner. In a recent study, we found that intronic transposable elements (TEs) are spliced into neural transcripts and diversify the splice isoform repertoire of neurons and glia (Treiber and Waddell, 2020). A recent paper by Azad et al. revisits these findings using their TIDAL analysis pipeline applied to our published data (Azad et al., 2024). Their analysis did not find any of the splicing reads we reported, and although they used RT-PCR to test seven of the 264 TE-gene pairs we had previously reported, they failed to validate TE-gene splicing in any of them. Here, we conduct a quantitative analysis of TE exonisation and show that intronic TE insertions are frequently recruited as alternative exons, with exon usage ranging from rare events to near-complete inclusion in transcripts. We implement this analysis in an improved version of our TEChim software, and present clear support for TE-gene splicing at the seven loci tested by Azad et al. We also identify methodological issues in the experimental and computational design of the Azad et al. study that likely explain their failure to detect TE-gene chimeras, while demonstrating that TE-gene splicing can be detected by RT-PCR under appropriate experimental conditions. Together, our data demonstrates that TE splice isoforms are not rare artefacts but measurable and biologically relevant features of the Drosophila brain transcriptome that may contribute to the molecular complexity and functional adaptability of the brain.

5
Australian giant kelp genome assemblies show distinct Southern Hemisphere genetics

Scharfenstein, H. J.; Carroll, A.; Iha, C.; Schwoerbel, J.; Jordan, R.; Willis, A.

2026-02-21 genomics 10.64898/2026.02.20.707121 medRxiv
Top 0.1%
17.1%
Show abstract

Giant kelp, Macrocystis pyrifera, occurs across northern and southern Hemisphere temperate coasts and is at high risk from ocean warming. Few giant kelp forests remain across the Southeast Australian shelf, while a handful of forests are actively being restored. Genomic resources can greatly aid in the conservation of remnant populations and enhance restoration efforts. Reference genomes are a fundamental resource as they are a prerequisite to, or enhance, many analyses used in conservation genomics. A single reference genome is available for giant kelp, assembled from a Californian haploid specimen. However, increasing evidence of genetic divergence between Northern and Southern Hemisphere populations highlights the need for regionally representative reference genomes. Here, we present two genome assemblies from the diploid vegetative tissue of Australian giant kelp specimens. We performed de novo genome assembly using long-read sequencing (PacBio HiFi and ONT R10.4 Simplex) and scaffolded the assemblies with the ONT reads, assembling 98-99% of the genomes into 35 pseudo-chromosomes. Genome sizes ranged from 528-534 Mbp, with BUSCO completeness scores of 97-98% and QV scores of 51-52. Genome annotation identified 17,565-17,800 genes in the Australian genomes. Genomic divergence between the Australian and Californian giant kelp genomes was seven-fold greater than between Australian genomes (1.5% vs 0.2%), supporting a Northern-Southern Hemisphere genetic divergence. Functional divergence was also observed between Australian and Californian genomes, reflected by differing patterns of enrichment in gene ontologies linked to energy metabolism, proteostasis and stress responses. These two new genome assemblies will serve as valuable resources for ongoing research into Southern Hemisphere giant kelp genetics, while providing the basis for genomic-guided conservation and restoration of remnant giant kelp forests in Australia.

6
A pilot study for whole proteome tagging in C. elegans

Eroglu, M.; Hobert, O.

2026-02-10 genetics 10.64898/2026.02.09.704846 medRxiv
Top 0.1%
14.6%
Show abstract

Tagging all proteins encoded by an animal genome with a fluorescent tag would open many windows to the discovery of unexpected patterns of protein expression and localization. To scale such an approach, it would be beneficial to introduce multiple, spectrally distinct fluorophore tags in parallel. As a first step in this direction, we undertook a pilot study in the nematode C. elegans, in which we set out to tag 30 different genetic loci with three different fluorophores, with 3 tags being introduced at a time. By choosing essential genes, predicted based on transcriptomics to cover a range of expression levels, we explore issues relating to disrupting gene function and visibility of tagged proteins. We demonstrate that such a tagging approach is highly efficient and indeed reveals unanticipated patterns of cellular sites of expression, as well as subcellular protein localization. We hope that this pilot study will motivate attempts to scale this tagging approach to more loci and, ultimately, the whole genome.

7
Identification of transgene insertion sites via short- and long-read whole genome sequencing.

Kaplan, L.; Edgerton, S. J.; Mahoney, B. D.; Ray, C. A.; Reh, T. A.

2026-02-05 molecular biology 10.64898/2026.02.05.704064 medRxiv
Top 0.1%
14.5%
Show abstract

Transgenic mouse lines are essential to uncover organ or system level genotype-phenotype relationships. The generation of such lines via transgene addition may lead to the insertion into unknown genomic loci potentially leading not only to the disruption of native genes but also attenuation of transgene expression. Additionally, this often results in the inability to determine transgene zygosity which in turn complicates breeding and interpretation of experimental results. In this study we present two whole genome sequencing based pipelines that allow the identification and genotyping of even complex multi transgenic inserts. As they use widely available reagents and bioinformatic tools, they can easily be applied to develop genotyping strategies in potentially any species.

8
Discovery and in vivo characterization of novel TOG domain-containing proteins using C. elegans

Wethekam, L. C.; Mittal, T.; Maddox, A. S.

2026-02-24 cell biology 10.64898/2026.02.23.707544 medRxiv
Top 0.1%
14.3%
Show abstract

The proper assembly, architecture, and maintenance of microtubule, actin and other cytoskeletal networks require regulation by various polymer binding proteins. Microtubules, rely on both the tubulin building blocks, but also many tubulin- and microtubule binding proteins. TOG domain-containing proteins comprise one family of tubulin-binding proteins that regulate microtubule dynamics. Here we identify two previously uncharacterized TOG domain-containing proteins (TOD-1 and TOD-2) in the nematode, C. elegans. These proteins are unique in that they are members of the XMAP215 family but contain reduced numbers of TOG domains and, in one case, a divergent TOG domain. TOD-1 and TOD-2 are expressed in and contribute to the normal function of sperm. The single TOG domain of TOD-1 and both TOG domains of TOD-2 are predicted to bind free tubulin dimers and not microtubule lattice. Deletion of either tod gene resulted in an increased laying of unfertilized oocytes. Inspection of mutant hermaphrodites revealed a premature onset of sperm migration failure. Together, these findings suggest that C. elegans requires regulation of tubulin dimers and/or microtubules for sperm localization and function. The amoeboid movement of C. elegans sperm has been considered microtubule-independent, our results open a new avenue of research into their unique motility.

9
Uncertainty-aware breeding decisions: MCMC-based optimum contribution selection increases breeding decision robustness

Ahlinder, J.; Waldmann, P.

2026-03-18 genetics 10.64898/2026.03.15.711440 medRxiv
Top 0.1%
14.2%
Show abstract

Current optimum contribution selection (OCS) implementations use point estimates of estimated breeding values (EBVs), potentially leading to suboptimal selections when individuals have uncertain genetic evaluations. We developed a framework assessing how EBV uncertainty affects OCS decisions through MCMC-based approaches using the COSMO optimizer in Julia, evaluated on Norway spruce (Picea abies, n=5,525) and Loblolly pine (Pinus taeda, n=926) populations. Agreement between point estimate (MAP-OCS) and MCMC-OCS was surprisingly low: mean overlap of only 26.6 (4.8) individuals in Norway spruce genotyped subpopulation and 14.1 (3.6) in full pedigree, with Loblolly pine intermediate at 16.0 (9.6). Despite this low individual-level agreement, selection frequency across MCMC iterations corresponded well with EBV rankings (Spearman{rho} = 0.782 for Norway spruce), confirming that higher-EBV individuals were preferentially selected under posterior uncertainty. To comprehensively quantify uncertainty impacts, we employed two complementary metrics: individual robustness scores measuring genetic gain stability upon candidate removal, and population-level contribution distribution metrics capturing concentration of genetic gain across selected individuals. Applying these metrics identified 25 high-risk individuals in Norway spruce and nine in Loblolly pine, and constrained exclusion of these individuals improved individual robustness by 16.5% in Loblolly pine (3.00% genetic gain loss) and 29.8% in Norway spruce (2.14% genetic gain loss). Our uncertainty-aware OCS framework successfully identifies unstable selections that may compromise long-term genetic gain, and we recommend assessing EBV uncertainty through posterior distributions and evaluating population-specific trade-offs when implementing uncertainty-aware selection strategies.

10
Introgression from the wild relative Manihot glaziovii on cassava (M. esculenta) chromosome 1 exhibits segregation distortion and no direct effect on dry matter

Villwock, S. S. C.; Rabbi, I. Y.; Ikpan, A. S.; Ogunpaimo, K.; Nafiu, K.; Kayondo, S. I.; Wolfe, M.; Jannink, J.-L.

2026-02-21 genetics 10.64898/2026.02.20.707074 medRxiv
Top 0.1%
14.1%
Show abstract

The cassava (Manihot esculenta) genome has two large introgressions from its wild relative M. glaziovii on chromosomes 1 and 4 that originate from historical hybridization efforts. The 10 Mbp chromosome 1 introgression has been increasing in frequency in African breeding populations due to its statistical association with higher dry matter content and root number. However, the region also exhibits suppressed recombination, hindering breeders ability to combine favorable glaziovii alleles with the cultivated esculenta background. Since homozygous introgressed lines are rarely selected for advanced trials, dominance effects have not been well-characterized. To analyze the effects of the introgression with higher resolution, we generated a population of over 5000 seedlings from crosses between heterozygous introgressed parents and screened for recombinants using ten KASP markers tagging glaziovii-specific alleles. An optimized subset of 453 lines was then selected and evaluated over two years for yield and vigor traits. Unlike previous studies, composite interval mapping and mixed linear models showed no significant associations between glaziovii alleles and dry matter content or root number. Small, opposing effects on clonal vigor were observed at different ends of the introgression. The region showed significant segregation distortion and enrichment of putative deleterious alleles. Genome alignment of M. esculenta and M. glaziovii assemblies did not show any major structural variants in the introgression region, suggesting that suppressed recombination is likely driven by sequence-level divergence rather than structural rearrangements. These results indicate that the glaziovii introgression does not directly contribute to dry matter, supporting the need for recombination and purging of the glaziovii introgression to aid cassava improvement. Plain language summaryA large chromosome segment from a wild relative of cassava is an important structural aspect in the cassava genome. Since the chromosome segment tends to be inherited as one block, its effects on cassava traits were not well resolved. Through genetic mapping at higher resolution, we identified that the wild segment impacts early vigor and does not appear to impact dry yield, as was previously thought. While there are no major structural differences between the wild and cultivated chromosome segments, their overall divergence seems to suppress the wild chromosome segment from pairing with the cultivated chromosome segment during reproduction. In the apparent absence of any major benefits from the wild segment, removing it from the breeding population may be beneficial. Core ideasO_LIA set of glaziovii allele-specific markers were designed to track the chromosome 1 introgression haplotype. C_LIO_LISegregation distortion suggests the presence of recessive deleterious or lethal alleles in the introgression. C_LIO_LIIncreased recombination is needed to purge deleterious alleles enriched in introgression region. C_LIO_LIThe glaziovii introgression was associated with slightly lower vigor rating and stem diameter. C_LIO_LIThe effects of the previously-identified glaziovii DM QTL were not detected in this population. C_LI

11
A Fluorescent Dauer Marker in Caenorhabditis inopinata Enables Comparative Analysis of Dauer-Inducing Mechanisms

Iitsuka, R.; Haruta, N.; Oomura, S.; Sugimoto, A.

2026-04-09 developmental biology 10.64898/2026.04.06.716796 medRxiv
Top 0.1%
12.7%
Show abstract

Dauer larvae are a dormant developmental stage in nematodes that is induced by a range of environmental cues. The molecular mechanisms that transduce these cues to regulate dauer entry have been well characterized in Caenorhabditis elegans, whereas those in other nematode species remain unclear. The closest known sibling species of C. elegans, Caenorhabditis inopinata, occupies a distinct ecological niche and shows an extremely low frequency of dauer formation by starvation in laboratory conditions, suggesting that it could serve as a useful comparative model for analyzing dauer-inducing mechanisms. To support such analysis, we generated a fluorescent dauer reporter, Cin-col-183p::mCherry, in C. inopinata based on a previously reported dauer-specific reporter in C. elegans. This reporter showed fluorescence specifically in the pre-dauer and dauer stages, but not in other developmental stages, indicating that it functions as a dauer-specific marker in C. inopinata. Using these marker strains, we compared the responses to high temperature and RNAi-mediated knockdown of insulin/IGF-1 pathway genes (daf-2, age-1, and pdk-1), and found that dauer induction differs mechanistically between C. elegans and C. inopinata. This dauer-specific fluorescent strain will be a useful tool for investigating the diversity of dauer-inducing mechanisms across nematode species. Article SummaryDauer is a dormant developmental stage in nematodes induced by environmental stress. Although its regulation is well studied in Caenorhabditis elegans, the mechanisms in other species remain unclear. Here, we developed a fluorescent dauer reporter, Cin-col-183p::mCherry, in Caenorhabditis inopinata, a close relative of C. elegans. The reporter was specifically expressed in pre-dauer and dauer stages, confirming its usefulness as a dauer marker. Using this strain, we found that responses to high temperature and insulin/IGF-1 pathway gene knockdown differ between C. elegans and C. inopinata. This reporter will help reveal diversity in dauer-inducing mechanisms across nematode species.

12
The genetic basis of resistance to pathogens in rainbow trout: a meta-QTL analysis

Rodriguez-Vazquez, R.; Karami, A. M.; Robledo, D.; Buchmann, K.

2026-02-14 genomics 10.64898/2026.02.13.705103 medRxiv
Top 0.1%
12.3%
Show abstract

Rainbow trout is affected by a broad range of pathogens causing large economic losses and animal welfare concerns. Marker-assisted selection can significantly enhance resistance to pathogens in a few generations, and to this end many studies have focused on identifying quantitative trait loci (QTLs) for resistance traits. The integration of accumulated genetic resources provides an opportunity to uncover important genetic variation and candidate genes crucially involved in rainbow trout immunity. Here, we present a comprehensive meta-QTL (MQTL) analysis based on the integration of 145 QTLs related to pathogen resistance. These QTLs were refined into 26 MQTLs, of which 15 were validated by genome-wide association studies (GWAS). The average confidence interval (CI) of these MQTLs was reduced by 2.03-fold compared to the initial QTL, improving mapping precision. Integration of GWAS results revealed regions along the rainbow trout genome pivotal for pathogen resistance, and a major region in chromosome 3, which could be used in marker-assisted selection. Further, among the validated MQTLs we identified a subset of high-confidence MQTLs, based on those supported by at least three initial QTL from more than two independent studies, with a percentage of variance explained greater than 8% and a LOD score higher than three. Gene annotation identified 11 unique candidate genes within these high-confidence MQTLs involved in immune pathways, encoding proteins involved in the regulation of immune responses, signalling pathways, receptor activity, and direct immune effector production. The MQTLs and candidate genes identified are valuable resources for advancing molecular breeding and unravelling the genetic basis of pathogen resistance in rainbow trout.

13
SIN3 Regulates Transcriptional and Longevity Responses to Glycolytic Perturbation in Drosophila melanogaster

Amarasinghe, A. P.; Pile, L. A.

2026-03-11 genetics 10.64898/2026.03.09.710676 medRxiv
Top 0.1%
10.1%
Show abstract

Cellular metabolism and gene transcription are closely linked. The conserved transcriptional regulator SIN3 acts as a scaffold for histone deacetylase (HDAC)-containing complexes and is crucial for development, stress resistance, and overall organismal health. SIN3 regulates metabolic gene expression in Drosophila cultured cells, however, an understanding of the extent of its role in coordinating responses to metabolic stress in whole organisms is incomplete. In this study, we explored how SIN3 controls glycolytic gene expression across developmental stages and under genetic and dietary disruption of glycolysis in Drosophila melanogaster. Focusing on four key glycolytic enzymes: phosphofructokinase (Pfk), enolase (Eno), pyruvate kinase (Pyk), and pyruvate dehydrogenase beta (Pdhb), we found that reducing Sin3A levels increases their expression in both larvae and adults, indicating that SIN3 plays a consistent role in balancing metabolic gene transcription. Genetic interaction experiments indicate that Sin3A interacts with Pyk and Eno, regulating transcription in a gene-specific manner. Disrupting glycolysis via genetic or dietary means alters glycolytic gene expression, and SIN3 modulates this response. These findings indicate that SIN3 functions as a metabolic sensor, regulating transcription in response to cellular metabolic stress. Additionally, we demonstrate that reducing Sin3A levels shortens Drosophila lifespan on both low- and high-sucrose diets, emphasizing the importance of SIN3 in longevity. Overall, these results show that SIN3 is a context-dependent regulator of glycolytic gene expression and lifespan in Drosophila, integrating metabolic signals with chromatin-based transcriptional regulation. SummaryTo survive and thrive, organisms must adapt to distinct metabolic inputs. We investigated the response of the conserved transcriptional regulator SIN3 to metabolic stress and its control of glycolytic gene expression in Drosophila melanogaster. By measuring glycolytic gene expression, testing genetic interactions, and assessing lifespan under genetic and dietary perturbations, we found that Sin3A knockdown elevates glycolytic gene expression in a gene-specific manner and decreases longevity. SIN3 also modulates transcriptional responses to disrupted glycolysis and influences lifespan under sucrose stress. These findings identify SIN3 as a context-dependent transcription regulator that links gene expression with organismal metabolic adaptation.

14
Genome-Wide Linkage Disequilibrium, Haplotype Block Structure, and Population Diversity in Nili-Ravi Buffalo (Bubalus bubalis)

Ahmad, A.; mustafa, h.; Khan, W. A.; Manan, A.; Anwer, I.; Akram, W.

2026-02-23 genomics 10.64898/2026.02.22.707281 medRxiv
Top 0.1%
10.1%
Show abstract

Linkage disequilibrium (LD) and haplotype block structure govern the resolution and utility of genomic selection, marker-assisted selection, and genome-wide association studies (GWAS) in livestock. We performed a comprehensive genome-wide characterization of LD decay, haplotype block architecture, and population diversity across all 24 autosomes in Nili-Ravi buffalo (Bubalus bubalis; n = 85), using 43,543 post-quality-control SNPs. Mean genome-wide r2 was 0.124 (median 0.074) and mean D was 0.540 (median 0.481), with LD half-decay at {approx}70 kb. A total of 133 haplotype blocks encompassing 721 SNPs were identified (Gabriel et al., 2002). Haploview analysis of nine chromosomes harbouring bTB resistance candidate genes revealed contrasting selection signatures: directional selection at innate immune loci (IFNG, TLR1; H < 0.55) versus balancing selection at adaptive immune loci (BoLA-DRB3, SP110; H > 1.0). Critically, BBU15 Block 3 (28.6 kb; OR52E5/NCR1 locus, 47.16 Mb) showed a genome-wide significant integrated haplotype score (iHS; -log1 0 p = 5.408), directly co-localising with the published bTB susceptibility QTL (Bermingham et al., 2014). The TAA haplotype (frequency 53.3%) at this block represents a candidate resistance-associated haplotype for marker-assisted selection. These findings provide essential parameters for SNP panel design and bTB resistance breeding in South Asian buffalo.

15
The distribution of fitness effects of new mutations in regulatory regions of the D. melanogaster genome

Daigle, A.; Marsh, J.; Kay, A.; Johri, P.

2026-03-03 evolutionary biology 10.64898/2026.03.01.708907 medRxiv
Top 0.2%
9.6%
Show abstract

Although non-coding regions play important roles in gene regulation and contribute to individual fitness, the precise distribution of fitness effects (DFE) of new mutations in these regions remains poorly understood. Here, we carefully compile experimentally validated regulatory regions in non-coding regions in the Drosophila melanogaster genome and identify putatively neutral sites near them. Incorporating a realistic genomic architecture that mimics the placement of the regulatory and coding regions, as well as a realistic heterogeneity in recombination and mutation rates across the genome, we use forward-in-time simulations to assess the power and accuracy of population genetics approaches that infer the DFE of new mutations in these regions. While the parameters of DFEs primarily comprising moderately and strongly deleterious mutations are estimated accurately, those of a DFE comprising mostly mildly deleterious mutations are misinferred. Applying these insights to three African D. melanogaster populations, we find that a large fraction of new mutations in functionally important non-coding regions are moderately deleterious, as opposed to strongly deleterious in coding regions. While the fraction of beneficial substitutions in regulatory regions (0.25-0.45) was also lower in coding regions ([~]0.5), our results suggest that non-coding regions contribute a majority of new deleterious mutations and beneficial substitutions in D. melanogaster populations. By incorporating both the genomic distribution and the inferred DFE of non-coding regions, we demonstrate that the effects of background selection across the genome are more accurately captured than with coding regions alone, highlighting the importance of considering selection on non-coding regions when interpreting patterns of genomic variation.

16
Spatial confinement of gene drives: Assessing risk of failure using global sensitivity analysis

Butler, C. D.; Lloyd, A. L.

2026-02-19 evolutionary biology 10.64898/2026.02.18.706608 medRxiv
Top 0.2%
8.5%
Show abstract

Gene drives allow pest populations to be genetically modified to reduce their harm on agriculture and human health. The genetic modification, or payload, spreads within a target population at rates exceeding normal Mendelian inheritance. While gene drives have demonstrated immense potential in laboratory populations, they present unique challenges. Foremost among these challenges is spatial confinement, or ensuring that the payload remains confined to target populations. However, there is an inherent tension between gene drive spread and spatial confinement: increasing the spreading efficiency of a gene drive increases the risk of escape, while engineering confinement mechanisms increases the risk of gene drive extinction. In this work, we explore spatial outcomes in gene drives designed for spatial confinement and the dependence of these outcomes on target organism dispersal and payload fitness cost. We use a stochastic spatial model to compute the probability of failure for each gene drive, and use techniques from global sensitivity analysis to quantify the contribution of dispersal and fitness cost to variance in gene drive performance. Our findings reveal how spatial outcomes are affected by key parameters, and how this sensitivity varies tremendously between different gene drives. These spatial properties can be used to classify gene drive behavior and are useful to determine suitability for a particular application.

17
Bacterial pyruvate metabolism regulates host insulin sensitivity in C. elegans

Mechan Llontop, M.; Nauta, K. M.; Gates, D.; Wang, X.; Cooper, J.; Burton, N. O.

2026-03-05 microbiology 10.64898/2026.03.04.709558 medRxiv
Top 0.2%
8.5%
Show abstract

Two different individuals can consume an identical diet but experience different physiological outcomes. While there are many potential mechanistic reasons for this, one increasingly recognized reason is that differences in an animals microbiome can lead to differences in the processing of dietary nutrients. Thus, though a diet might start out the same, how it is experienced by a host is dependent on their microbiome. While exciting, mechanistic studies of diet-microbiome-host effects are often limited by the lack of high throughput laboratory techniques to identify and define interactions between dietary metabolites, microbial metabolism, and host biology. We hypothesized that the model organism Caenorhabditis elegans is an advantageous animal model for rapidly identifying and genetically dissecting interactions between dietary nutrients, microbial metabolism, and host physiology. Here, we used an established model of the effects of dietary glucose on insulin resistant mutant animals (daf-2/IR mutants) to study how differences in bacterial metabolism influence the consequences of dietary sugars on animal physiology. We found that the effect of dietary sugars on daf-2 mutant physiology is dependent on how the microbiome metabolizes dietary sugars. We found dietary sugar suppresses multiple daf-2 mutant phenotypes in the presence of some bacteria but has no effect in the presence of others. To determine how bacteria mediate the effects of dietary sugars on host physiology we screened 5,000 transposon mutations in the canonical C. elegans dietary bacteria, E. coli OP50 for effects on animal insulin signaling. From this, we found that the effects of exogenous sugars on the phenotype of daf-2 mutant animals is dependent on the function of pyruvate dehydrogenase in bacteria and that the loss of bacterial pyruvate dehydrogenase genes (ex. aceE) is sufficient to mimic the effects of dietary sugars on dauer formation, longevity, and gene expression in insulin signaling deficient animals. Collectively, our findings further support the growing body of evidence that the effects of dietary nutrients on animal physiology can be influenced by the gut microbiome. In addition, these studies demonstrate the advantages of the C. elegans model system for studying 3-way diet-microbiome-host interactions that are difficult to dissect in other model systems.

18
High-efficiency targeted integration of extrachromosomal arrays in C. elegans using PhiC31 integrase

Rich, M. S.; Pellow, R.; Hefel, A.; Rog, O.; Jorgensen, E. M.

2026-01-21 genetics 10.1101/2025.11.11.687718 medRxiv
Top 0.2%
8.3%
Show abstract

Extrachromosomal arrays are unique chromosome-like structures created from DNA injected into the C. elegans germline. Arrays are easy to create and allow for high expression of multiple transgenes. They are, however, unstable unless integrated into a chromosome. Current methods for integration, such as X-rays and CRISPR, damage DNA and are low-efficiency. Here, we demonstrate that the viral integrase PhiC31, which mediates a non-mutagenic recombination between short attB and attP sequences, can be used for extremely efficient and targeted integration of arrays. In this method, a transgene, a selectable marker, and attP sites are injected into the gonad of a strain that (1) has an attB site in its genome, and (2) expresses PhiC31 in its germline. F1 extrachromosomal arrays are cloned, grown for multiple generations with selection, and then screened for homozygous array integrations. The procedure is simple, requires less time than screening for extrachromosomal arrays, and arrays can be screened for transgene function after stable integration. Arrays that transmit are integrated by PhiC31 with 50-95% efficiency, allowing for the isolation of many unique integrants from a single injection. Arrays can also be integrated at fluorescent landing pads and arbitrary sites in the genome. Using nanopore sequencing, we show that three new integrated arrays are between 1.6 and 18 megabases in length, assemble with large repeats, and can contain hundreds of copies of injected transgenes. We have built a collection of strains and plasmids to enable array integration at multiple sites in the genome using various selections. PhiC1-mediated Integration of Arrays of Transgenes (PhiAT) will allow C. elegans researchers to shift from using unstable extrachromosomal arrays to directly integrating arrays.

19
Wheat mutants lacking Starch Synthase 1 have altered starch composition and cell wall content

Trafford, K.; Fahy, B.; Gonzalez, O.; Pasquariello, M.; Ahn-Jarvis, J. H.; Mayne, J.; Kosik, O.; Lovegrove, A.; Warren, F. J.; Hazard, B.

2026-02-25 plant biology 10.64898/2026.02.24.707672 medRxiv
Top 0.2%
8.3%
Show abstract

Starch Synthase 1 (SS1) participates in the synthesis of amylopectin. To determine its role in hexaploid bread wheat, we selected and combined TILLING mutations for each homoeologue to create two independent SS1-deficient lines. The lines, which have different combinations of ss1 mutations, both lacked SS1 protein. Both lines exhibited mild but significant changes to starch phenotype, including fewer short amylopectin chains, a slight increase in B-type starch granules and a modest increase in amylose content. Lack of SS1 also led to changes in the thermal properties of starch measured by differential scanning calorimetry including reduced enthalpy, and increased gelatinization temperature. Despite the changes to starch properties, the starch contents of the mutant lines compared to wild types were within the normal range, as were grain weight and protein content. However, the concentration of total- and water-extractable arabinoxylan, and MLG, were increased in white flour compared to wild-type controls. HighlightLack of SS1 led to changes in starch molecular structure and thermal behavior. Starch content and grain weight were normal but cell wall polysaccharides in white flour were increased.

20
A stable genomic variant for photoperiodic flowering plasticity to enhance grain mold escape and yield stability in sorghum

Hodehou, D. A. T.; Diatta, C.; Bodian, S.; Ndour, M.; Sambakhe, D.; Sine, B.; Felderhoff, T.; Diouf, D.; Morris, G. P.; Kane, N. A.; Faye, J. M.

2026-04-04 genetics 10.64898/2026.04.01.715939 medRxiv
Top 0.2%
8.3%
Show abstract

Grain mold severely constrains sorghum [Sorghum bicolor (L.) Moench] productivity and grain quality in subhumid environments. Photoperiod-sensitive flowering plays a key role in mold avoidance and yield stability along north-south rainfall gradients. In response to the high susceptibility of elite cultivars in subhumid zones of Senegal, we developed and characterized a recombinant inbred line (RIL) population derived from Nganda (grain mold-susceptible) and Grinkan (photoperiod-sensitive) varieties. The population was evaluated across three distinct agro-ecological zones over two years. Environmental indices derived from genotype-environmental interactions, together with defined growth windows, strongly influenced flag leaf appearance (FLA), a photoperiodic flowering trait. Plasticity parameters (intercept and slope) for environmental indices, FLA, grain mold severity, and yield enabled identification of loci contributing to flowering response, mold resistance, and yield stability. The maturity gene Ma1 and two QTLs for FLA, qFLA6.2 and qFLA6.3, were identified, stable across environments, and colocalized with grain mold and yield QTLs. The wild-type Ma1 allele from Grinkan delayed FLA and reduced grain mold damage but was not associated with increased yield. The Ma1 effect was confirmed using the developed breeder-friendly KASP marker, Sbv3.1_06_40312464K, in 174 F3 three-way cross families. Photoperiod-sensitive lines with intermediate-to-late FLA alleles showed strong negative associations with mold damage. Overall, the identified stable loci and candidate lines provide foundations for effective molecular breeding of climate-resilient varieties. PLAIN LANGUAGE SUMMARYGrain mold is a fungal disease that reduces sorghum grain yield and quality, particularly in subhumid climates. With the limited number of resistant elite varieties, photoperiod-sensitive flowering to day length variation can contribute to grain mold escape at the end of rainy seasons. We characterized 286 sorghum recombinant inbred lines across three contrasting environments over two years along rainfall gradients in Senegal. Using flag leaf appearance (FLA), which is a photoperiodic flowering trait, strong genotype-environment interactions for FLA and genotypic plasticity were revealed. We identified and validated the common genomic locus associated with FLA variation and its plasticity across environments, the canonical maturity gene Ma1, which was influenced by temperature variation across environments. The presence of Ma1 in the background of photoperiod-sensitive lines enhances grain mold avoidance and yield stability along rainfall gradients in Senegal. CORE IDEASO_LIWe investigated photoperiodic flowering plasticity in sorghum as a contributor to grain mold resistance and yield stability along rainfall gradients. C_LIO_LIThe Maturity locus Ma1 (qFLA6.1) is the major contributor of photoperiodic flowering and its plasticity across semi-arid and subhumid environments. C_LIO_LIHybrid genotypes carrying two stable loci qFLA6.1 and qFLA6.2 sustain high grain mold avoidance in diverse environments. C_LIO_LIPhotoperiod-sensitive lines with medium to late flowering times are effective in avoiding grain mold, while maintaining yield stability in subhumid regions. C_LI